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We present a methodology for detecting non-linearities in data sets based on the characterization 
of the structural features of the Fourier phase maps. A Fourier phase map is a 2D set of points 
M = {(4>%, 't'k+A)}' where 0£ is the phase of the fc-mode of the Fourier transform of the data set 
and A a phase shift. The information thus rendered on this space is analyzed using the spectrum 
of weighted scaling indices to detect phase coupling at any scale A. We propose a statistical test 
of significance based on the comparison of the properties of phase maps created from both the 
original data and surrogate realizations. We have applied our method to the Lorenz system and 
the logarithmic stock returns of the Dow Jones index. Applications to higher dimensional data are 
straightforward. The results indicate that both the Lorenz system and the Dow Jones time series 
exhibit significant signatures of non-linear behavior. 
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The first step in the characterization of a data set is 
usually the calculation of the 'linear properties', i.e. the 
power spectrum and the amplitude distribution. How- 
ever, in many cases the estimation of higher-order prop- 
erties is required. For instance, given an image it is possi- 
ble to generate from it a new one by shuffling the Fourier 
phases. The resulting image may look different although 
the phase shuffling process preserves the power spectrum 
of the original one. Then, the Fourier phases contain in- 
formation which is beyond the linear properties of the 
data, the so-called non-linear properties. A challenging 
problem is then the extraction and characterization of the 
information contained in the Fourier phases. The phases 
are a powerful indicator of the structure of a data set. ft 
was early noticed that an image synthesized by keeping 
the Fourier amplitudes and changing the Fourier phases 
retains almost no feature of the original image. How- 
ever, image synthesized by keeping the Fourier phases 
and changing the Fourier amplitudes conserve most of the 
structure observed on the original image []]. Studies of 
Fourier phase coupling are mostly found in the field of as- 
trophysics where the aim is to characterize the growth of 
large-scale structure in the universe 0, and to test for 
non-Gaussian signatures in the Cosmic Microwave Ra- 
diation Background Q. Except for these few cases, the 
methods to test for non-linearities have never focused 
on the analysis of phase correlations, although some of 
them are based on phase randomization procedures. This 
is probably due to difficulties in constructing good es- 
timates out of phases since they are circular quantities 
(defined modulo 2tt) and not translational invariant, as 
was already noted in In this work, we present a 

method to analyze the Fourier phase information based 
on the assessment of the so called 'Fourier Phase Maps'. 
First, the method of surrogates [E IS HI El is use d 
to generate an ensemble of data sets which mimic the 
linear properties of the original data set however wip- 
ing out higher-order correlations. Then, we generate for 



both the original data set and the surrogate data phase 
maps which are subsequently characterized b y m eans of 
the spectrum of weighted scaling indices [HJ, LLJj ■ If the 
computed measure for the original data is significantly 
different from the value obtained for the surrogate data 
set, one can infer that the data were generated by a non- 
linear process. 

Consider an ./V-dimensional data set {xl} with Fourier 
phases {<f>t}- Given a phase shift A a phase map 
is defined as the two dimensional set of points M = 
^ n possible wave numbers fc are consid- 
ered up the the Nyquist frequency so as not to include 
redundant information. The phase maps posses attract- 
ing features which are useful to reveal non-linear prop- 
erties. First, it is well known that the Fourier transform 
of a random Gaussian variable has uncorrelated and uni- 
formly distributed phases in the interval [— 7r,7r]. Then, 
the plot of the phase map generated by such a vari- 
able will be a uniform point distribution on the square 
bounded by y = ±7r and x = ±7r. Here, we gener- 
ate surrogate data sets using the iterative amplitude ad- 
justed Fourier transform algorithm (ITAAFT) |7j,|8j. By 
means of this method, we generate surrogate data sets 
which keep the same amplitude distribution in real space 
and power spectrum of the original data set however de- 
stroying higher-order correlations. It has been recently 
demonstrated that this algorithm can be generalized for 
two and three dimensional data sets fl3l | . The surro- 
gate method destroys higher-order properties by means 
of a phase randomization procedure. Thus, we expect the 
phase maps of surrogate data to be more uniform. This 
is actually the phase map feature which will be charac- 
terized in order to develop a test for non-linearities. 

We have used the Scaling Index Method (SIM) to 
quantify the structural features of the phase maps. This 
technique which was inspired in the analysis of nonlinear 
systems 0, has been successfully applied in different 
fields of research, ranging from astrophysical to medical 
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applications [3 GH 13 13 E3 

The SIM characterizes 
the structural features of a point distribution by means 
of the analysis of its local scaling behavior. 

We first define a local weighted cumulative point dis- 
tribution p as p(xi,R) = J2j s R{d-(^i^j))i where sr(») 
denotes a kernel function which depends on the scale pa- 
rameter R and a distance measure In principle, any 
differentiable kernel function and any distance measure 
can be used. The weighted scaling indices a(xi,R) are 
then obtained by calculating the logarithmic derivative 
of p(xi, R) with respect to R, 

m d\ogp(xi,R) R d 

a( Xi ,R) = dlogR = - m p{*i,m ■ (1) 

We use the Euclidean norm as distance measure and a 
set of exponential shaping functions. So, the expres- 
sion for p simplifies to p(xi,R) — e "^■~^~' >q , where 
dij = \\xi — Xj\\. The exponent q controls the weighting 
of points according to their distance to the point where a 
is calculated. In this study, we use q = 2. The weighted 
scaling indices can then be written as 



through the following expression 



a(xi,R) 



Ej2( ^)2 e -(^) 



(2) 



It should be noted that the use of shaping functions has 
two advantages, namely (i) one obtains an analytical ex- 
pression for a(xi,R), and (ii) the scaling region is only 
determined by the parameter R. The scaling-index ai 
characterizes the structural surrounding of afj. The prob- 
ability P(a)da — Prob(a G [a, a + da]) is therefore a sta- 
tistical measure of the distribution of elementary struc- 
tural components. The choice of the scaling region is not 
a trivial task and is crucial to obtain meaningful results 
using this technique. Although one is normally guided by 
the size of the substructures under investigation, various 
scaling ranges should be considered in order to obtain 
meaningful results. 

Consider a 2D nearly uniform phase map. Since the 
scaling index a describes the local scaling behavior, most 
of the a- values will be close to a = 2, leading to a nearly 
Gaussian frequency distribution centered at a = 2. How- 
ever, if the phase map contains structure, the frequency 
distribution will show a weaker signal around a — 2 and 
new a-values will appear. The circular property of the 
phases implies that the embedding square is topologically 
equivalent to a torus. Then, the calculation of scaling in- 
dices was performed using periodic boundary conditions. 

We propose a statistical test of non-linearities based 
on the P(a) spectrum. First, we generate surrogate data 
sets using the ITAAFT algorithm. Then, we create for 
the original and the surrogate data sets phase maps for 
phase shifts A = {Ai, • • • , A m }, where A m is the maxi- 
mum phase shift considered. The significance is defined 



P (q,A ; )- < P s (a,AQ > 
a(P.(a,£t)) 



I = 1, • • • , m, 



(3) 

where P is the frequency distribution of the original 
phase map, and < P s > and c(P s ) are the mean value 
and the standard deviation over the frequency distribu- 
tions of the phase maps of the surrogate realizations. 
Equation @ is meaningful only when P s (a, A) are Gaus- 
sian distributed. Although there is no proof for this state- 
ment, simulations show that mixing processes satisfy this 
condition. Then, S is measured in units of standard devi- 
ations. As mentioned above, we expect that the original 
(non-linear) data set leads to wider P{a) spectra while 
the surrogate data sets will produce P(a) spectra with 
a stronger signal around a = 2. Then, the statistical 
test will give a positive result if the following condition 
is satisfied 

6 fa : S(a, A) < -2.6 A 1-90 < a < 2.1, 
la: S(a, A) > 2.6 /\ a elsewhere, 

where the value 2.6 corresponds to 1% of the quantile of 
the normal distribution. However, even when the con- 
dition given by Eq. Q holds we are not certain that 
original and surrogate phase maps are significantly dif- 
ferent since the scaling index is a local structure measure. 
In fact, every scaling index value corresponds to a region 
on the phase map. Thus, Eq. (@J determines the region 
of the phase maps where we can significantly differentiate 
between the original and surrogate phase maps. In the 
case that this region is tiny, we can not state that the 
original and the surrogate phase maps are significantly 
different. In order to address this problem, we define a 
new quantity S through the following expression 



S(A) = £p o (a,,A) 



(5) 



where ai are the a-values that satisfy Eq. Q. It should 
be noted that < 3 < 1. 

Thus, 5 can be interpreted as the probability to sig- 
nificantly distinguish between the original and the sur- 
rogate phase maps. The probability S accounts for both 
the significance and the extent of the region where a non- 
uniform structure exists. 

We have applied our statistical test to two time se- 
ries, namely the z-component of the Lorenz system in a 
chaotic regime and the logarithmic stock returns of the 
Dow Jones. In both cases, we generate 20 surrogates us- 
ing the ITAAFT algorithm and phase shifts in the range 
A = {!.,•••, 100} are considered. The phase map struc- 
ture is analyzed using the SIM with R = 0.8. 

Figure 1(a) shows the time series of the 2-component of 
the Lorenz system in a chaotic regime and a surrogate re- 
alization. One can clearly observe differences between the 
time series although they have not only the same ampli- 
tude distribution but also the fluctuations show the same 
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Figure 1: a) Time series of the z-component of the Lorenz 
system in a chaotic regime a = 10, r = 28, and b — 8/3 
and below a surrogate realization, b) Log-log plot of the scal- 
ing of the fluctuations obtained using the DFA2. The curves 
have been shifted to allow for comparison, c) Phase map of 
the Lorenz system for A = 1. d) Phase map of a surrogate 
realization for A = 1. 
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Figure 2: a) Time series of the logarithmic daily returns of 
the Dow- Jones for the period 1930-2003. Below, a surrogate 
realization, b) Log-log plot of the scaling of the fluctuations 
obtained using the DFA2. The curves have been shifted to 
allow for comparison, c) Phase map of the logarithmic returns 
of the Dow Jones for A = 1. d) Phase map of a surrogate 
realization for A = 1. 



scaling behavior (see Fig. 1(b)). The scaling behavior of 
the fluctuations was obtained using the Detrended Fluc- 
tuation Analysis of order 2 (DFA2) 01 • This analysis is 
equivalent to the power spectrum analysis [l^. Figure 
1(c) and 1(d) show phase maps for these time series. We 
observe that the phase map of the original time series dis- 
plays high and low density regions. Although the phase 
map of the surrogate time series is actually more uniform, 
it still displays a great deal of structure. In fact, most of 
the the surrogate realizations of the Lorenz system gener- 



Figure 3: Frequency distribution of a- values, a) Lorenz sys- 
tem, b) Dow Jones index. In both cases, the full line (dashed 
line) indicates the original phase map (surrogate phase map). 



ated using the ITAAFT algorithm display phase coupling 
and only some of them are free from phase correlations. 
Figure 3(a) shows the frequency distribution of scaling 
indices P{a) for the phase maps shown in Fig. 1. P(a) 
of the surrogate realization is narrower and more concen- 
trated around a = 2 which is the value expected for a 
uniform distribution. Figure 4(a) shows the significance 
for a phase shift A = 28. We observe that a high signifi- 
cance is obtained for a wide range of scaling index values. 
The lines at S = 20 were included in intervals where di- 
vergencies appear. Figure 4(c) shows the probability 2 
versus A. It indicates that the probability to significantly 
differentiate between the original Lorenz time series and 
the surrogates oscillates around 5 ~ 0.40. As expected, 
the Lorenz system shows signatures of non-linear behav- 
ior. 

Several studies of single stock have focused on the sta- 
tistical analyses of the dynamics to model the financial 
markets [2jj. It has been noticed that economic indices 
exhibit a non-linear behavior [2l|, [2^ which share some 
qualitative features with turbulence [2^, I24I |25| . How- 
ever, understanding the process that underlies the macro- 
scopic behavior of the stocks remains at a speculative 
level. As noticed above, the Fourier phases are a pow- 
erful indicator of the data structure. It is then relevant 
to unveil and quantify the properties of the phases for 
stock indices. Typically, the economic indices show a 
correlated short-time behavior (few days) which crosses 
over to an uncorrelated asymptotic behavior. Figure 2(a) 
shows the logarithmic returns of the day to day closing 
price of the Dow Jones index in the period 1930-2003 |2|j 
and a surrogate realization. Figure 2(b) shows the scal- 
ing behavior of the fluctuations. The asymptotic regime 
is governed by an exponent 7 ~ 0.5 which is the signature 
of uncorrelated behavior [Ig. However, the phase maps 
shown in Fig. 2(c) and 2(d) reveal a new scenario. The 
phase map for the original Dow Jones index displays a 
high density diagonal band and low density regions. On 
the other hand, the phase map for the surrogate real- 
ization looks quite uniform. In this case, the ITAAFT 
algorithm always generates surrogates free from phase 
coupling. Figure 3(b) shows the frequency distribution 
of scaling indices for the original and the surrogate phase 
maps. The spectrum of scaling indices for the phase map 
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Figure 4: a) Significance for Lorenz system. The dashed 
horizontal lines at S — ±2.6 indicate 1% of the quantile of 
the normal distribution, b) Idem for the Dow Jones index, c) 
Probability to significantly differentiate between original and 
surrogate phase maps for the Lorenz system, d) Idem for the 
Dow Jones index. 

of the surrogate realization shows a sharp peak around 
a = 2. As occurred for the Lorenz system, Fig. 4(b) 
shows that high S values are obtained for a wide range 
of scaling indices. Figure 4(d) indicates that there is 
a high probability to significantly differentiate between 
the original Dow Jones index and the surrogate realiza- 
tions for phase shifts up to A c ~ 20. Below A C! the 
non-linearities appear to be stronger than in the Lorenz 
system. However, increasing the phase shift A the prob- 



ability H almost vanishes. 

We have developed a new test for non-linearities based 
on the characterization of the Fourier phase maps using 
the SIM. As expected, the Lorenz system showed signa- 
tures of non-linear behavior at all A scales. However, we 
have noted that for this time series the ITAAFT algo- 
rithm is not always able to generate surrogates free from 
phase coupling. Then, our method could also be used to 
assess the quality of surrogate data sets since the perfor- 
mance of the generating algorithms is data dependant. 
Even though the fluctuations of the Dow Jones index 
display an uncorrelated asymptotic regime, this time se- 
ries contains non-linearities which seem to be stronger 
than in the Lorenz system. This is probably because the 
ITAAFT algorithm is unable to generate surrogates free 
from phase coupling for the Lorenz system. Our results 
indicate that a novel characteristic scale of non-linearities 
A c exists for the Dow Jones index. These findings may 
be useful to deeper understand the market dynamics and 
thus improve the results of risk assessments. 

This method can be applied to higher dimensional data 
sets since algorithms to generate surrogates in higher di- 
mensions are available |l2l Il3| . In cases where weak non- 
linearities may be present as in the Cosmic Microwave 
Background of radiation , the use of local structure 
measures for the assessment of the phase maps, like the 
scaling indices, instead of global measures may be more 
appropriate since global measures may average important 
local details of the maps. The development of quite sen- 
sitive tests is particularly relevant for this problem where 
the presence or absence of non-Gaussian signatures will 
support different evolutionary theories of the universe. 
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